Mining Sequential Patterns

نویسندگان

  • Rakesh Agrawal
  • Ramakrishnan Srikant
چکیده

We are given a large database of customer transac tions where each transaction consists of customer id transaction time and the items bought in the transac tion We introduce the problem of mining sequential patterns over such databases We present three algo rithms to solve this problem and empirically evalu ate their performance using synthetic data Two of the proposed algorithms AprioriSome and Apriori All have comparable performance albeit AprioriSome performs a little better when the minimum number of customers that must support a sequential pattern is low Scale up experiments show that both Apri oriSome and AprioriAll scale linearly with the num ber of customer transactions They also have excel lent scale up properties with respect to the number of transactions per customer and the number of items in a transaction

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distributed Sequential Pattern Mining: A Survey and Future Scope

Distributed sequential pattern mining is the data mining method to discover sequential patterns from large sequential database on distributed environment. It is used in many wide applications including web mining, customer shopping record, biomedical analysis, scientific research, etc. A large research has been done on sequential pattern mining on various distributed environments like Grid, Had...

متن کامل

A Framework for Mining Closed Sequential Patterns

Sequential pattern mining algorithms developed so far provide better performance for short sequences but are inefficient at mining long sequences, since long sequences generate a large number of frequent subsequences. To efficiently mine long sequences, closed sequential pattern mining algorithms have been developed. These algorithms mine closed sequential patterns which don’t have any super se...

متن کامل

Sequential Patterns Postprocessing for Structural Relation Patterns Mining

Sequential patterns mining is an important datamining technique used to identify frequently observed sequential occurrence of items across ordered transactions over time. It has been extensively studied in the literature, and there exists a diversity of algorithms. However, more complex structural patterns are often hidden behind sequences. This article begins with the introduction of a model f...

متن کامل

Exploring multi-dimensional sequential patterns across multi-dimensional multi-sequence databases

Existing multi-dimensional sequential pattern mining methods only discover multi-dimensional sequential pattern in databases involving one sequential dimension. Since multi-dimensional sequential patterns may exist in databases containing more than one sequential dimension, in this paper, we present algorithm PSeq-MIDim for mining multi-dimensional sequential patterns from multiple sequential d...

متن کامل

Sequential Pattern Mining by Pattern-Growth: Principles and Extensions

Sequential pattern mining is an important data mining problem with broad applications. However, it is also a challenging problem since the mining may have to generate or examine a combinatorially explosive number of intermediate subsequences. Recent studies have developed two major classes of sequential pattern mining methods: (1) a candidate generation-and-test approach, represented by (i) GSP...

متن کامل

Multidimensional Sequential Pattern Mining

Data mining is the task of discovering interesting patterns from large amounts of data. There are many data mining tasks, such as classification, clustering, association rule mining, and sequential pattern mining. Sequential pattern mining is the process of finding the relationships between occurrences of sequential events, to find if there exists any specific order of the occurrences. It is a ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995